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[82] Nucleotides of polymorphic sites in the extension products were assayed using mass 

spectrometry, MALDI-TOF (Matrix Assisted Laser Desorption and Ionization-Time of 
Flight). The MALDI-TOF is operated according to the following principle. When an 
analyte is exposed to a laser beam, it flies toward a detector positioned at the opposite 
side in a vacuum state, together with an ionized matrix (3-hydroxypicolinic acid). At 
this time, the time taken for the analyte to reach the detector is calculated. A material 
with a smaller mass reaches the detector more rapidly. The nucleotides of SNPs in the 
target DNA fragments are determined based on a difference in mass between the DNA 
fragments and known nucleotide sequences of the SNPs. 

[83] Determination results of nucleotides of polymorphic sites of the target DNAs using 

the MALDI-TOF are shown in Tables 1 and 2 above. Each allele may exist in the form 
of homozygote or heterozygote in an individual. According to Mendel s Law of in- 
heritance and Hardy-Weinberg Law, a genetic makeup of alleles constituting a 
population is maintained at a constant frequency. When the genetic makeup is sta- 
tistically significant, it can be considered to be biologically meaningful. The SNPs 
according to the present invention occur in colorectal cancer patients at a statistically 
significant level, as shown in Tables 1 and 2, and thus, can be efficiently used in 
diagnosis of colorectal cancer. 
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Claims 

[1] A polynucleotide comprising at least 10 contiguous nucleotides of a nucleotide 

sequence selected from the group consisting of nucleotide sequences of SEQ ED 

NOS: 1-12 and comprising a nucleotide at position 101 of the nucleotide 

sequence, or a complementary polynucleotide thereof. 
[2] A polynucleotide which is hybridized with the polynucleotide of claim 1 or the 

complementary polynucleotide thereof. 
[3] The polynucleotide of claim 1 or 2, which is 10 to 100 nucleotides in length, or 

the complementary polynucleotide thereof. 
[4] The polynucleotide of claim 1 , which is a primer or a probe. 

[5] A microarray for diagnosis of colorectal cancer, which comprises the 

polynucleotide of claim 1 or the complementary polynucleotide thereof. 
[6] A kit for diagnosis of colorectal cancer, which comprises the polynucleotide of 

claim 1 or the complementary polynucleotide thereof. 
[7] A method of diagnosing colorectal cancer in an individual, which comprises: 

isolating a nucleic acid sample from the individual; and 

determining a nucleotide of at least one polymorphic site (position 101) within 
polynucleotides of SEQ ID NOS: 1-12 or complementary polynucleotides 
thereof. 

[8] The method of claim 7, wherein the operation of determining the nucleotide of 

the at least one polymorphic site comprises: 

hybridizing the nucleic acid sample onto a microarray on which the 
polynucleotide of claim 1 or its complementary polynucleotide is immobilized; 
and 

detecting a hybridization result. 
[9] The method of claim 7, wherein when at least one selected from the group 

consisting of A, A, C, G, G, T, G, C, G, G, A, and A which are respective risk 
alleles of the polynucleotides of SEQ ID NOS: 1-12 is detected, it is determined 
that the individual has a higher likelihood of being diagnosed as at risk of 
developing colorectal cancer. 
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nucleotide (e.g., A allele). In this regard, when the first allele (e.g., A allele) exists in 
the target DNA fragments, products in which only a nucleotide (e.g., T nucleotide) 
complementary to the first allele extended from the primers will be obtained. On the 
other hand, when a second allele (e.g., G allele) exists in the target DNA fragments, a 
nucleotide (e.g., C nucleotide) complementary to the second allele is added to the 3 - 
ends of the primers and then the primers are extended until a nucleotide com- 
plementary to the closest first allele nucleotide (e.g., T nucleotide) is added. The 
lengths of products extended from the primers were determined by mass spectrometry. 
Therefore, alleles present in the target DNA fragments could be identified. Illustrative 
experimental conditions were as follows. 

First, unreacted dNTPs were removed from the PGR products. For this, 1.53D of 
pure water, 0.1 7D of HME buffer, 0.30 D of shrimp alkaline phosphatase (SAP) were 
added and mixed in 1 .5 ml tubes to prepare SAP enzyme solutions. The tubes were 
centrifuged at 5,000 rpm for 10 seconds. Thereafter, the PCR products were added to 
the SAP solution tubes, sealed, incubated at 37°C for 20 minutes and then 85°C for 5 
minutes, and stored at 4°C. 

Next, homogeneous extension was performed using the target DNA fragments as 
templates. The compositions of reaction solutions for the extension were as follows. 

Water (nanoscale pure water) 1.728D 

hME extension mix (lOxbuffer containing 2.25 mM d/ddNTPs) 0.200D 
Extension primers (lOOjiM for each) 0.054D 
Thermosequenase (32U/D) 0.01 8D 
Total volume 2.00D 

The reaction solutions were thoroughly mixed with the previously prepared target 
DNA solutions and subjected to spin-down centrifugation. Tubes or plates containing 
the resultant mixtures were compactly sealed and incubated at 94°C for 2 minutes, 
followed by 40 cycles at 94°C for 5 seconds, at 52°C for 5 seconds, and at 72°C for 5 
seconds, and storage at 4°C. The homogeneous extension products thus obtained were 
washed with a resin (SpectroCLEAN). Extension primers used in the extension are 
listed in Table 3 below. 

Table 3 

Primers for amplification and extension primers for homogeneous extension for target 
DNAs 



Marker 


Amplification primer (SEQ ID NO.) 


Extension primer 
(SEQ ID NO.) 


Forward primer 


Reverse primer 
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selected SNPs were used to assay nucleotides of SNPs in the DNA samples. 

1. Preparation of DNA samples 

DNA samples were extracted from blood streams of colorectal cancer patients and 
normal persons. DNA extraction was performed according to a known extraction 
method (Molecular cloning: A Laboratory Manual, p 392, Sambrook, Fritsch and 
Maniatis, 2nd edition, Cold Spring Harbor Press, 1989) and the specification of a 
commercial kit manufactured by Centra system. Among extracted DNA samples, only 
DNA samples having a purity (measured by A^/A^nm ratio) of at least 1 .6 were 
used. 

2. Amplification of target DNAs 

Target DNAs, which are predetermined DNA regions containing SNPs to be 
analyzed, were amplified by PCR. The PCR was performed by a common method as 
the following conditions. First, target genomic DNAs were diluted to concentration 2.5 
ng/ml. Then, the following PCR mixture was prepared. 

Water (HPLC grade) 2.240 

lOx buffer (15 mM MgCl 2 , 25 mM MgCy 0.50 

dNTP Mix (GIBCO) (25 mM for each) 0.040 

Taq pol (HotStar) (5U/D) 0.020 

Forward/reverse primer Mix (1 \iM for each) 0.020 

DNA 1.000 

Total volume 5.000 

Here, the forward and reverse primers were designed based on upstream and 
downstream sequences of SNPs in known database. These primers are listed in Table 3 
below. 

The condition of PCR were as follows: incubation at 95°C for 15 minutes; de- 
naturation at 95°C for 30 seconds, annealing at 56°C for 30 seconds, and extension at 
72°C for 1 minute and these are repeated 45 times; and finally incubation at 72°C for 3 
minutes and storage at 4°C. As a result, amplified target DNA fragments which were 
200 or less nucleotides in length were obtained. 

3. Analysis of nucleotides of SNPs in amplified target DNA fragments 
Analysis of the nucleotides of SNPs in the amplified target DNA fragments was 

performed using a homogeneous MassEXTEND (hME) technique available from 
Sequenom. The principle of the MassEXTEND technique is as follows. First, primers 
(also called as extension primers ) ending immediately one base before SNPs within 
the target DNA fragments were designed. Then, the primers were hybridized with the 
target DNA fragments and DNA polymerization was initiated. At this time, a poly- 
merization solution contained a reagent (e.g., ddTTP) terminating the polymerization 
immediately after the incorporation of a nucleotide complementary to a first allelic 
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According to an embodiment of the present invention, the operation of determining 
the nucleotide (n) of the at least one polymorphic site includes hybridizing the nucleic 
acid sample onto a microarray on which polynucleotides for diagnosis or treatment of 
colorectal cancer, including at least 10 contiguous nucleotides derived from the group 
consisting of nucleotide sequences of SEQ ID NOS: 1-12 and including a nucleotide of 
a polymorphic site (position 101), or complementary polynucleotides thereof are im- 
mobilized; and detecting the hybridization result. 

A microarray and a method of preparing a microarray by immobilizing a probe 
polynucleotide on a substrate are well known in the pertinent art. Immobilization of a 
probe polynucleotide associated with colorectal cancer of the present invention on a 
substrate can be easily performed using a conventional technique. Hybridization of 
nucleic acids on a microarray and detection of the hybridization result are also well 
known in the pertinent art. For example, the detection of the hybridization result can be 
performed by labeling a nucleic acid sample with a labeling material generating a 
detectable signal, such as a fluorescent material (e.g., Cy3 and Cy5), hybridizing the 
labeled nucleic acid sample onto a microarray, and detecting a signal generated from 
the labeling material. 

Effect of the Invention 

The polynucelotide according to the present invention can be used for diagnosis, 
treatment, or fingerprinting analysis of colorectal cancer. 

The microarray and diagnostic kit including the polynucleotide according to the 
present invention can be used for efficient diagnosis of colorectal cancer. 

The method of analyzing polynucleotides associated with colorectal cancer 
according to the present invention can efficiently detect the presence or a risk of 
colorectal cancer. 

BEST MODE FOR CARRYING OUT THE INVENTION 

Hereinafter, the present invention will be described more specifically by Examples. 
However, the following Examples are provided only for illustrations and thus the 
present invention is not limited to or by them. 

Examples 

Example 1 

In this Example, DNA samples were extracted from blood streams of a patient 
group consisting of 300 Korean men and women that had been diagnosed as colorectal 
cancer patients and had been being under treatment and a normal group consisting of 
300 Korean men and women free from symptoms of colorectal cancer patient group, 
and occurrence frequencies of specific SNPs were evaluated. The SNPs were selected 
from a known database (NCBI dbSNP:http://www.ncbi.nlm.nih.gov/SNP/) or 
(Sequenom:http://www.realsnp.com/). Primers hybridizing with sequences around the 



WO 2005/079173 



10 



PCT/KR2005/000465 



derived from the other member. In this case, hybridization conditions should be s 
tringent enough to allow hybridization with only one allele by significant difference in 
hybridization strength between alleles. Preferably, the central portion of the probe, that 
is, position 7 for a 15 nucleotide probe, or position 8 or 9 for a 16 nucleotide probe, is 
aligned with each polymorphic site of the nucleotide sequences of SEQ ID NOS: 1-12. 
Therefore, there may be caused a significant difference in hybridization between 
alleles. The probe of the present invention can be used in diagnostic methods for 
detecting alleles. The diagnostic methods include nucleic acid hybridization-based 
detection methods, e.g., southern blot. In a case where DNA chips are used for the 
nucleic acid hybridization-based detection methods, the probe may be provided as an 
immobilized form on a substrate of a DNA chip. 

[43] The present invention also provides a microarray for diagnosis of colorectal cancer, 

including the polynucleotide according to the present invention or the complementary 
polynucleotide thereof. The polynucleotide of the microarray may be DNA or RNA. 
The microarray is the same as a common microarray except that it includes the 
polynucleotide of the present invention. 

[44] The present invention also provides a colorectal cancer diagnostic kit including the 

polynucleotide of the present invention. The colorectal cancer diagnostic kit may 
include reagents necessary for polymerization, e.g., dNTPs, various polymerases, and a 
colorant, in addition to the polynucleotide according to the present invention. 

[45] The present invention also provides a method of diagnosing colorectal cancer in an 

individual, which includes: isolating a nucleic acid sample from the individual; and de- 
termining a nucleotide (n) of at least one polymorphic site (position 101) within 
polynucleotides of SEQ ID NOS: 1-12 or complementary polynucleotides thereof. 
Here, when the nucleotide of the at least one polymorphic site of the sample nucleic 
acid is the same as at least one risk allele presented in Tables 1 and 2, it is detennined 
that the individual has a higher likelihood of being diagnosed as at risk of developing 
colorectal cancer. 

[46] The operation of isolating the nucleic acid sample from the individual may be 

carried out by a common DNA isolation method. For example, the nucleic acid sample 
can be obtained by amplifying a target nucleic acid by polymerase chain reaction 
(PCR) followed by purification. In addition to PCR, there may be used LCR (Wu and 
Wallace, Genomics 4, 560 (1989), Landegren et aL, Science 241, 1077 (1988)), tran- 
scription amplification (Kwoh et aL, Proc. Natl Acad ScL USA 86, 1 173 (1989)), self- 
sustained sequence replication (Guatelli et aL, Proc. Natl Acad. ScL USA 87, 1874 
(1990)), or nucleic acid sequence based amplification (NASBA). The last two methods 
are related with isothermal reaction based on isothermal transcription and produce 30 
or 100-fold RNA single strands and DNA double strands as amplification products. 
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of colorectal cancer, which is hybridized with a polynucleotide including at least 10 
contiguous nucleotides containing the nucleotide of a polymorphic site of a nucleotide 
sequence selected from the group consisting of the nucleotide sequences of SEQ ID 
NOS: 1-12, or a complement thereof. 

[39] The allele-specific polynucleotide refers to a polynucleotide specifically hybridized 

with each allele. That is, the allele-specific polynucleotide has the ability that dis- 
tinguishes nucleotides of polymorphic sites within the polymorphic sequences of SEQ 
ID NOS: 1-12 and specifically hybridizes with each of the nucleotides. The hy- 
bridization is performed under stringent conditions, for example, conditions of 1M or 
less in salt concentration and 25°C or more in temperature. For example, conditions of 
5xSSPE (750mM NaCl, 50mM Na Phosphate, 5mM EDTA, pH 7.4) and 25-30°C are 
suitable for allele-specific probe hybridization. 

[40] In the present invention, the allele-specific polynucleotide may be a primer. As used 

herein, the term "primer" refers to a single stranded oligonuleotide that acts as a 
starting point of template-directed DNA synthesis under appropriate conditions, for 
example in a buffer containing four different nucleoside triphosphates and polymerase 
such as DNA or RNA polymerase or reverse transcriptase and an appropriate 
temperature. The appropriate length of the primer may vary according to the purpose 
of use, generally 15 to 30 nucleotides. Generally, a shorter primer molecule requires a 
lower temperature to form a stable hybrid with a template. A primer sequence is not 
necessarily completely complementary with a template but must be complementary 
enough to hybridize with the template. Preferably, the 3 end of the primer is aligned 
with a nucleotide (n) of each polymorphic site of SEQ ID NOS: 1-12. The primer is 
hybridized with a target DNA containing a polymorphic site and starts an allelic am- 
plification in which the primer exhibits complete homology with the target DNA. The 
primer is used in pair with a second primer hybridizing with an opposite strand. 
Amplified products are obtained by amplification using the two primers, which means 
that there is a specific allelic form. The primer of the present invention includes a 
polynucleotide fragment used in a ligase chain reaction (LCR). 

[41] In the present invention, the allele-specific polynucleotide may be a probe. As used 

herein, the term "probe" refers to a hybridization probe, that is, an oligonucleotide 
capable of sequence-specifically binding with a complementary strand of a nucleic 
acid. 

[42] Such a probe may be a peptide nucleic acid as disclosed in Science 254, 1497-1500 

(1991) by Nielsen et al. The probe according to the present invention is an allele- 
specific probe. In this regard, when there are polymorphic sites in nucleic acid 
fragments derived from two members of the same species, the probe is hybridized with 
DNA fragments derived from one member but is not hybridized with DNA fragments 
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significant difference between the case group and the normal group. 

- At the risk allele column, when a reference allele is A2 and the allele A2 
frequency of the case group is larger than the allele A2 frequency of the normal group 
(i.e., cas_A2>con_A2), the allele A2 is regarded as risk allele. In an opposite case, 
allele Al is regarded as risk allele. 

- Odds ratio represents the ratio of the probability of risk allele in the case group to 
the probability of risk allele in the normal group. In the present invention, the Mantel- 
Haenszel odds ratio method was used. CI represents 95% confidence interval for the 
odds ratio and is represented by (lower limit of the confidence interval, upper limit of 
the confidence interval). When 1 falls under the confidence interval, it is considered 
that there is insignificant association of risk allele with disease. 

- HWE represents that the result satisfied Hardy- Weinberg Equilibrium. Here, 
con_HWE and cas_HWE represent degree of deviation from the Hardy- Weinberg 
Equilibrium in the normal group and the case group, respectively. Based on 
chi_varue=6.63 (p-value=0.01, df=l) in a chi-square (df=l) test, a value larger than 
6.63 was regarded as Hardy- Weinberg Disequilibrium (HWD) and a value smaller than 
6.63 was regarded as Hardy- Weinberg Equilibrium (HWE). 

- Call rate represents the number of genotype-interpretable samples to the total 
number of samples used in experiments. Here, cas_call_rate and con_call_rate 
represent the ratio of the number of genotype-interpretable samples to the total number 
(300 persons) of samples used in the case group and the normal group, respectively. 

- rs represents SNP identification number in NCBI dbSNP. 

Tables 1 and 2 present characteristics of SNP markers based on the NCBI build 119 
(February 1,2005). 

As shown in Tables 1 and 2, according to the chi-square test of the polymorphic 
markers of SEQ ID NOS: 1-12 of the present invention, chi_exact__p-value ranges from 
0.0000008 to 0.049 in 95% confidence interval. This shows that there are significant 
differences between expected values and measured values in allele occurrence 
frequencies in the polymorphic markers of SEQ ID NOS: 1-12. Odds ratio ranges from 
1.30 to 2.06, which shows that the polymorphic markers of SEQ ID NOS: 1-12 are 
associated with colorectal cancer. 

Therefore, the polynucleotide according to the present invention can be efficiently 
used in diagnosis, fingerprinting analysis, or treatment of colorectal cancer. In detail, 
the polynucleotide of the present invention can be used as a primer or a probe for 
diagnosis of colorectal cancer. Furthermore, the polynucleotide of the present 
invention can be used as antisense DNA or a composition for treatment of colorectal 
cancer. 

The present invention also provides an allele-specific polynucleotide for diagnosis 
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[23] In Tables 1 and 2, the contents in columns are as defined below. 

[24] - AssayJD represents a marker name. 

[25] - SNP is a polymorphic base of a SNP polymorphic site. Here, Al and A2 represent 

a low mass allele and a high mass allele, respectively, as a result of sequence analysis 
according a homogeneous MassEXTEND (hME) technique (Sequenom) and are 
optionally designated for convenience of experiments. 

[26] - SNP sequence represents a sequence containing a SNP site, i.e., a sequence 

containing allele Al or A2 at position 101. 

[27] - At the allele frequency column, cas_A2, con_A2, and Delta respectively represent 

allele A2 frequency of a case group, allele A2 frequency of a normal group, and the 
absolute value of the difference between cas_A2 and con_A2. Here, cas_A2 is 
(genotype A2A2 frequency x2 + genotype Al A2 frequency )/(the number of samples x 
2) in the case group and con_A2 is (genotype A2A2 frequency x 2 + genotype Al A2 
frequency )/(the number of samples x 2) in the normal group. 

[28] - Genotype frequency represents the frequency of each genotype. Here, cas_Al Al, 

cas_AlA2, and cas_A2A2 are the number of persons with genotypes Al Al, A1A2, 
and A2A2, respectively, in the case group, and con_AlAl, con_AlA2, and con_A2A2 
are the number of persons with genotypes A1A1, A1A2, and A2A2, respectively, in 
the normal group. 

[29] - df=2 represents a chi-squared value with two degree of freedom. Chi-value 

represents a chi-squared value and p- value is determined based on the chi-value. 
Chi_exact_p- value represents p-value of Fisher s exact test of chi-square test. When 
the number of genotypes is less than 5, results of the chi-square test may be inaccurate. 
In this respect, determination of more accurate statistical significance (p-value) by the 
Fisher s exact test is required. The chi_exact_p-value is a variable used in the Fisher s 
exact test. In the present invention, when the p-value<0.05, it is considered that the 
genotype of the case group is different from that of the normal group, i.e., there is a 
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Table 2 

Characteristics of the polymorphic sequences of SEQ ID NOS: 1-12 
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polynucleotide preferably is 10 to 200 nucleotides in length, more preferably 10 to 100 
nucleotides in length, and still more preferably 10 to 50 nucleotides in length. 

[19] Each of the nucleotide sequences of SEQ ID NOS: 1-12 is a polymorphic sequence. 

The polymorphic sequence refers to a nucleotide sequence containing a polymorphic 
site at which single-nucleotide polymorphism (SNP) occurs. The polymorphic site 
refers to a position of the polymorphic sequence at which SNP occurs. The nucleotide 
sequences may be DNAs or RNAs. 

[20] In the present invention, each polymorphic site (position 101) of the polymorphic 

sequences of SEQ ID NOS: 1-12 is associated with colorectal cancer. This is 
confirmed by DNA nucleotide sequence analysis of blood samples from colorectal 
cancer patients and normal persons. The analysis results are summarized in Tables 1 
and 2. 

[21] Table 1 

Association of the polymorphic sequences of SEQ ID NOS: 1-12 with colorectal 
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applied for most patients suffering colorectal cancer. 

The most serious problem in early diagnosis or prognosis of various cancers and 
complicated diseases, including colorectal cancer, is that the diagnosis or prediction 
could be performed by a physical technique when the cancers and complicated diseases 
are at an advanced stage. However, the developments of recent various molecular 
biological techniques and the preliminary completion of the human genome project 
enable finding of genes or genetic variations directly/indirectly related to a disease. 
Therefore, early diagnosis that predicts the incidence of a disease using a genetic 
factor, instead of using a conventional phenotype- or phenotypic disease-dependent 
diagnostic method, becomes available. Currently, biochemical or molecular biological 
techniques are available for colorectal cancer diagnosis. Due to the lack of information 
about genes or genetic variations related to the cancer and correlation between the 
genes or genetic variations and colorectal cancer incidence rate, early diagnosis of a 
desired level for both patients and doctors is not made in case of colorectal cancer 
diagnosis using molecular biological techniques. Additionally, in most diagnosis cases 
using a single biological marker, it is common that the sensitivity and specificity of the 
marker are not satisfied at the same time. Generally, if sensitivity is high, specificity is 
low, and vice versa. For this reason, the possibility to occur error in diagnosis is high 
so that it is difficult to accomplish accuracy of a desired level. Therefore, a single 
biological marker is used simply as diagnostic markers of preliminary screening for 
precise examinations. 

DETAILED DESCRIPTION OF THE INVENTION 

Technical Goal of the Invention 

The present invention provides a polynucleotide containing single-nucleotide 
polymorphism associated with colorectal cancer. 

The present invention also provides a microarray and a colorectal cancer diagnostic 
kit, each of which includes the polynucleotide containing single-nucleotide 
polymorphism associated with colorectal cancer. 

The present invention also provides a method of diagnosing a colorectal cancer 
using polynucleotides associated with colorectal cancer. 

Disclosure of the Invention 

The present invention provides a polynucleotide including at least 10 contiguous 
nucleotides of a nucleotide sequence selected from the group consisting of nucleotide 
sequences of SEQ ID NOS: 1-12 and including a nucleotide of a polymorphic site 
(position 101) of the nucleotide sequence, or a complementary polynucleotide thereof. 

The polynucleotide includes at least 10 contiguous nucleotides containing the 
nucleotide (expressed by n ) of a polymorphic site (position 101) of a nucleotide 
sequence selected from the nucleotide sequences of SEQ ID NOS: 1-12. The 
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Monoclonal antibodies specifically binding with the SNPs can also be used in 
diagnosis of the disease. Currently, research into the nucleotide sequences and 
functions of SNPs is under way by many research institutes. The nucleotide sequences 
and other experimental results of the identified human SNPs have been made into 
database to be easily accessible. 

[5] Even though findings available to date show that specific SNPs exist on human 

genomes or cDNAs, phenotypic effects of SNPs have not been revealed. Functions of 
most SNPs have not been disclosed yet except a small numbers of SNPs. 

[6] Colorectal cancer is a cancer that is very common in worldwide including Korea. In 

Korea, colorectal cancer is the fourth common cancer in both men and women. 
Colorectal cancer ranks fourth among cause of death by cancer and is responsible for 
about seven deaths per hundred thousand populations. Over the last 10 years, the death 
rate for colorectal cancer is increasing by about 80%. 

[7] It is known that the incidence of colorectal cancer is mainly caused by an envi- 

ronmental factor. Rapid westernization of diet and excess intake of animal fat or 
protein are major factors in the development of colorectal cancer. However, it is 
known that about 5% of colorectal cancer cases occur by a genetic cause. 

[8] More than 90% of colorectal cancer patients are those who are over 40 years of age. 

It is known that the incidence of colorectal cancer is more frequent in people (high risk 
group) with familial history related to colorectal cancer, inflammatory bowel disease, 
colonic polyp, ovarian cancer, uterine cancer, and breast cancer, in addition to people 
aged over 40 years. The incidence of colorectal cancer in young people with 30-40 
ages is mainly dominated by a genetic cause. 

[9] Early detection of colorectal cancer ensures almost 100% cure rate. Generally, i' 1 ■ a 

however, since colorectal cancer has no specific early symptoms, early detection is 
difficult. A fecal occult blood test (for detecting trace amounts of blood in the stool) is 
generally used as a screening test to detect colorectal cancer, in particular when the 
cancer is not causing any symptoms. The fecal occult blood test is a method for 
selecting persons for an additional precision examination. However, since this test has 
a high rate of false-positive results and false-negative results, it is not suitable for early 
diagnosis. Currently, exact diagnosis of colorectal cancer is made by barium enema ex- 
amination, endoscopy, radiation examination, and the like. A tumor marker called as 
CEA (carcinoembryonic antigen) is generally used to determine a developmental stage 
of colorectal cancer and to evaluate a therapeutic effect for colorectal cancer. But, still 
there are no universally recognized and verified tumor markers that enable early 
diagnosis or prediction of colorectal cancer through blood test. Several markers for 
screening or early diagnosis of patients belonging to high-risk groups who are 
susceptible to colorectal cancer are reported, but these markers have a limitation to be 
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Description 

A POLYNUCLEOTIDE ASSOCIATED WITH A COLON 
CANCER COMPRISING SINGLE NUCLEOTIDE 
POLYMORPHISM, MICROARRAY AND DIAGNOSTIC KIT 
COMPRISING THE SAME AND METHOD FOR DIAGNOSING 
A COLON CANCER USING THE POLYNUCLEOTIDE 
Technical Field 

[1] The present invention relates to a polynucleotide associated with colorectal cancer, 

a microarray and a diagnostic kit including the same, and a method of analyzing 
polynucleotides associated with colorectal cancer. 
Background Art 

[2] The genomes of all organisms undergo spontaneous mutation in the course of their 

continuing evolution, generating variant forms of progenitor nucleic acid sequences 
(Gusella, Ann. Rev. Biochem. 55, 831-854 (1986)). The variant forms may confer an 
evolutionary advantage or disadvantage, relative to a progenitor form, or may be 
neutral. In some instances, a variant form confers a lethal disadvantage and is not 
transmitted to subsequent generations of the organism. In other instances, a variant 
form confers an evolutionary advantage to the species and is eventually permanently 
incorporated into the DNA of most members of the species and effectively becomes 
the progenitor form. In many instances, both progenitor and variant form(s) survive 
and co-exist in a population of species. The coexistence of multiple forms of a 
sequence gives rise to polymorphisms. 

[3] Among polymorphisms, several types have been known, including restriction 

fragment length polymorphisms (RFLPs), short tandem repeats (STRs), variable 
number tandem repeats (VNTRs) and single-nucleotide polymorphisms (SNPs). 
Among them, SNPs take the form of single-nucleotide variations between individuals 
of the same species. When SNPs occur in protein coding sequences, some of the 
polymorphic forms may give rise to the non-synonymous change of amino acid 
causing expression of a defective or a variant protein. On the other hand, when SNPs 
occur in non-coding sequences, e.g., within intron, some of these polymorphisms may 
result in splicing variant of mRNA causing the expression of defective or variant 
proteins, too. Other SNPs could have no phenotypic effect at all. 

[4] It is estimated that human SNPs occur at a frequency of 1 in every 1,000 bp. When 

such SNPs influence the phenotypic expression such as a disease, polynucleotides 
containing the SNPs can be used as primers or probes for diagnosis of the disease. 



